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A SYSTEM AND METHOD FOR DIGITAL WATERMARKING IN A 
CALIBRATED PRINTING PATH 

RELATED CASES 

Reference is made to commonly-assigned copending U.S. Patent 

Application Serial No. (Attorney. Docket No. D/A2456), entitled A 

SYSTEM AND METHOD FOR DIGITAL WATERMARKING IN A CALIBRATED 
PRINTING PATH, by Ricardo L. de Queiroz et al, the disclosure(s) of which are 
incorporated herein. 

FIELD OF THE INVENTION 

The present invention generally relates to watermarking methods and, 

more particularly, to those methods for digital watermarking which embed hidden 

data in a manner designed for the printing process having high information 

channel capacity while not degrading image texture or resolution 

BACKGROUND OF THE INVENTION 
For many reasons, it is desirable to enable data hiding within a digital 

image. Those reasons include image security, authentication, covert 
communication, rendering instructions, and providing additional useful 
information. Some existing digital watermarking methods do not survive the 
printing process as most watermarking methods are designed for use in a non- 
printed contone image. Existing methods designed for contone images are too 
fragile to be encoded into a print and be retrievable. Glyphs and other low 
frequency methods that can be used in a print setting often introduce 
undesirable textures or lower the spatial resolution of the image. 

Many printing processes produce color by combining 4 or more colorants. 
The most common case is the use of cyan, magenta, yellow, and black 
colorants, referred to herein as C,M,Y,K. It is well known that these 4 colorants 
can be combined in many different ways to produce the same color, as 
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perceived by the human visual system. For a given image or application, the 
specific method of combining C, M, Y, K is determined by the gray-component 
replacement (GCR) function. This function is, in general, a multidimensional 
mapping that relates the basic subtractive primaries, C, M, Y to the actual C, M, 
Y, K colorants that constitute the final printed image. A simple and common 
form of the GCR function is defined by a pair of functions: one that relates 
min(C, M, Y) to the amount of K to be added, and another that relates min(C, M, 
Y) to the amount of C, M, Y to be removed or replaced by the K. The two 
functions are dependent upon each other in a way that yields the desired color. 
Note that the quantity min(C, M, Y) is the amount in which all three colorants are 
present, and thus is qualitatively thought of as the "gray component" of the color. 
Functions of this quantity that dictate the amount of K to be added and the 
amount of CMY to be replaced are thus collectively termed "gray component 
replacement". 

In principle, a given image can be rendered with different GCR functions 
to yield visually similar or identical color renditions. Of these a single GCR 
function is generally chosen to render a given image or object within a page. 
The choice is based on a number of factors, including achievable print gamut, 
colorant area coverage limitations, smoothness and texture considerations, 
sensitivity to registration, colorant cost, etc. 

What is needed in the art is a watermarking method designed for use in 
the printing process which does not degrade textures and which has a high 
information channel capacity. 

BRIEF SUMMARY 

What is disclosed is a decoding method for retrieving information bits encoded in 
a printed image comprising the steps of first receiving an input electronic image 
as a scanned version of the printed image. A region of interest in the image is 
then extracted and, for that region, an amount of K colorant present, denoted K H ; 
is obtained. Further, a color value is generated therefrom and the GCR used for 
encoding that region is determined using K H and the obtained color value. 
Encoded information bits are retrieved therefrom based on the determined GCR. 
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The estimated K H is preferably evaluated conditional to a capacity signal K L and 

a luminance signal L. From the obtained data, values of K H , K L , and L, are 

derived wherein K H is estimated from a high resolution scan, and K L and L are 

estimated from a down-scaled image, respectively. The capacity signal K L and 

the luminance signal L are derived from the obtained color value. Further, the 

capacity signal, K L is derived by first applying a suitable operator S to reduce the 

image from scanner resolution to the watermark resolution and then converting 

the obtained color values to CMY estimates such that K L =min(C,M,Y) 

Alternatively, K-capacity is derived from the amount, Kl, y, comprises first 

converting the obtained color values to CMY estimates and applying a suitable 

operator S to reduce the image from scanner resolution to the watermark 

resolution such that K L = min(S(C),S(M),S(Y)); wherein L is described by a linear 

combination of scan signals RGB, such that L = kiS(R) + k 2 S(G) + k3S(B). The 

value of K H is determined by first converting the obtained color values to CMY 

estimates. The estimates determine K-colorant amount at each pixel such that: 

K = min(C,M,Y), A suitable operator S is applied to reduce the image from 

scanner resolution to the watermark resolution. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The preferred embodiments and other aspects of the invention will 

become apparent from the following detailed description of the invention when 

read in conjunction with the accompanying drawings which are provided for the 

purpose of describing embodiments of the invention and not for limiting same, in 

which: 

Figure 1 illustrates the method of the present invention of inserting a 
watermark; and 

Figure 2 illustrates the method of the present invention of extracting the 
watermark. 

DESCRIPTION OF THE SPECIFICATION 
A method is disclosed for encoding or digital watermarking (data hiding) 

for use in the printing process and decoding. Attention is respectfully directed to 

Fig. 1. An input pixel 10 is received in an RGB color space or an other color 
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space which has been converted to CMY values, at 12. Alternatively, the input 
pixel is received as a CMY pixel. 

Two or more GCR functions are loaded into GCR processor 14 (shown 
referenced as GCR-1, GCR-2, GCR-A/). For AM3CR functions, information 
16 to be transmitted is converted therein into A/-ary form. The N-ary form is one 
wherein the information at each pixel can be represented by one of AAlevels. For 
example, if two GCR functions are to be used (Afc=2), the N-ary form for the 
information is binary, so that each pixel is represented by one of two levels [0, 1]. 
One might choose GCR1 = 100%GCR (maximum amount of gray component is 
replaced with K) to encode every 0, and GCR2 = 0%GCR (no K is used to 
replace the gray component) to encode every 1. Expressed analytically, GCR1 
and GCR2 of this simple example perform the transformation: CMYKi = (C- 
min(C,M,Y), M-min(C,M,Y), Y-min(C,M,Y) , min(C,M,Y)); for GCR-1 = GCRi 00 %; 
CMYK 2 = (C, M, Y, 0); and for GCR-2 = GCR 0 o /o ; where C, M and Y on the right 
side of the equations indicate values into the GCR processor 14, and C, M, Y, 
and K on the left side of the equations indicate values output from GCR 
processor 14. 

From the same CMY data input to GCR processor 14, more specifically, 
from the same CMY data at a particular image location, two or more alternative 
CMYK sets can be generated and output from GCR processor 14. The particular 
CMYK set utilized at the given location indicates the information state encoded at 
that location. For instance, using the example of 0% and 100%GCR from above, 
CMYK = (C,M,Y,0) at a particular location would have a 1 encoded at that 
location. 

The information bits 16 are arrayed in a sequence that produces a spatial 
orientation and layout when the CMYK values are printed. For instance, an 8x8 
pixel block of CMY data may receive the encoding of the first A/-ary number 
formed from the information bits. The second 8x8 block may receive the second 
N-ary number, and so on. Different block configurations may be used, or 
encoding may follow other space filling paths, such as along a space filling 
curve. Also, the encoding could be adaptive to image content, so that particular 
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image features, such as letters, graphic objects, pictures, etc., provide the spatial 
support for the encoding process. Since the data is arrayed spatially in the 
image, in some cases it is useful to have a reference point on the page from 
which the sequence of information can be read. The reference point could be a 
corner, a specially designed cross or dot pattern, or some agreed upon image 
feature. 

There are several ways that GCR processor 14 can convert input CMY 
values to output CMYK values. One example of operation processes the 
incoming data at a location only with the GCR function selected by the 
information bits (N-ary number) to be encoded at that location. Another method 
processes the incoming CMY data with all GCR functions, and the information 
bits associated with that location in space select the CMYK data to be used in 
forming the image. The CMYK sets formed thereby are printed to form output 
image 18. The information bits are now embedded onto the output print. 

If the printing path has been properly calibrated, printed CMYK sets, 
associated with GCR functions, will look essentially the same under the 
illuminant for which the calibration was performed. The GCR functions should 
be optimized so that they are easily differentiated from each other in the 
subsequent decoding process, while at the same time, producing colors that are 
visually substantially similar under the calibration illuminant. 

It should be appreciated by one skilled in this art that some regions in the 
output image will be less useful than others for embedding information using the 
present method. This includes regions where little or no GCR is necessary (e.g., 
C=0 or M=0 or Y=0) and very dark regions because it is more difficult to detect 
black pixels from high-resolution scans due to excessive dot overlap. These 
regions may be considered to have low or zero channel capacity for information. 
Various methods can be used to avoid information loss that could occur by 
attempting to encode information in these regions. For instance, the information 
could be encoded redundantly, throughout the image, thereby increasing the 
likelihood of transmitting the information in the print. Otherwise, low information 
capacity regions can be recognized by their input CMY values, and those regions 
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can be skipped in the encoding processing, and encoding would resume when 
the next region was input to GCR processor that had sufficient information 
capacity. 

Although current color management software found in the arts allows 
different GCR functions to be applied to different objects, the purpose therein is 
to enable the optimization of image quality whereas, the present invention uses it 
convey information. The present encoding method is described as spatially 
varying, as in varying across a pictorial image. It should be understood that the 
GCR may also vary with objects on a page. For instance different graphical 
features, letters, vignettes, and the like could each have a different GCR that is 
optimized to carry information. The watermark information can be a tag plane, or 
auxiliary image plane sometimes referred to as an omega channel or alpha 
channel, that is used to select the GCR functions in the encoding. 

With reference now being made to Fig. 2, to decode the embedded 
information, the printed document 20 is scanned into electronic form. The data 
contained therein are converted to RGB color space by this scanning process. 
The K colorant amounts are then estimated from the RGB data. By design, two 
or more colors match visually even though they were produced with different 
GCR strategies and thus have different K amounts for the same colored area. 
Thus, to retrieve K values from RGB data, information in addition to colorimetric 
values is used. In the present invention, the non-overlap properties of rotated 
halftone screens provide this additional information. 

The K-colorant signal, denoted K H , is estimated from a high-resolution 
scan of the RGB image. In one embodiment, this K-colorant signal is evaluated 
conditional to a capacity signal K L which relates the capacity of pixels to possess 
K-toner to luminance L. The importance of deriving K L is to normalize the value 
of K H against some reference point. Different GCR strategies generate images 
that may look different except under some illuminant for which the color 
correction was derived, i.e., illuminant metamerism might occur between the 
viewing illuminant and the scanner illuminant. Thus the GCR (and hence, the 
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embedded bit data) is estimated by analyzing the vector { K L , K H , L}, i.e., the 
value of K H given K L and L. 

In an alternate preferred embodiment, the K-colorant signal, K H , is 
evaluated conditional to a low-resolution colorimetric signal from the image, 
RGB. In other words, RGB is a low resolution measure of the color of a local 
area and provides a measure of the capacity of that area to possess K-colorant. 
The GCR (hence, the embedded bit data) is estimated by analyzing the vector 
{R, G, B, K H }, i.e., the value of K H given R, G, B. 

The scanned RGB values may be inverted to obtain CMY estimates: C=1- 
R, M=1-G, Y=1-B. One can then estimate K H = min(C,M,Y) = 1-max(R,G,B). If 
S is a suitable operator to reduce the image from the scanner resolution to the 
watermark resolution (e.g., a sequence of blurring filters followed by sub- 
sampling), then K H = S(min(C,M,Y)) = S(1-max(R,G,B)). Even though K H 
resembles that for generating K in a 100% GCR strategy, its present use is 
limited to high-resolution scanned data where the printer dots are discernible. 

In an alternate embodiment, scanner RGB values can be converted to an 
estimate of CMY by a calibration of the scanner system. For example, a 3x3 
linear transformation matrix can be used to best relate scanner RGB values to 
actual known amount of cyan, magenta, and yellow used in the print. It will be 
appreciated by those skilled in the art that other transformations can be used to 
relate scanned RGB to printed CMY. This transformation can be used to 
improve the estimation of K from scan RGB in the actual embedded prints. 

In the alternative, spatial properties of the high-frequency image are 
exploited to determine K H . Assuming halftoning with rotated screens, CMY dots 
do not frequently overlap completely in the highlight and mid-tone regions. 
Typically, the amount of overlap of CMY dots covers a small area percentage at 
highlights and mid-tones. If the input image has perfect dots and a scanner that 
has high enough resolution that it is able to perfectly scan the dots and their 
overlaps is used, then the scanned RGB values thereof only assume a small 
number of combinations that depend on the geometry of the dots against the 
scanner resolution. If each of the RGB values for any given pixel in the scan are 
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low enough then that pixel essentially comprises all black color. A black-colored 
pixel can then be assumed a K-dot. This K-dot assumption is a good 
approximation because for low RGB there must be either a K-dot or an 
overlapping of CMY-dots. Because the overlap area of CMY-dots is often small 
in highlights and mid-tones (due to rotated screens) it is therefore probable that 
the black-colored pixel is a K-dot. Additional information, for example, ensuring 
that the pixel has very low chroma, is also useful. Preferably, a combination of 
luminance and chroma can be used to distinguish between a K dot and an 
overlap of CMY dots. 

Additionally, at the dot center, a small K dot on a light background will 
have a lightness that is higher than a larger K dot on a darker background. Thus 
a transformation is necessary to normalize the lightness of the dot based on the 
scanned RGB. This is done as follows. One out of A/-GCRs can be estimated 
by first, in the training session, determining one image or region that was 
processed with each GCR. For each region or image, compute DDi, K L , 
LDDDE(Kh I K L , L, GCR=i), i.e., the average value of K H for each (K L , L) pair. In 
the on-line detection phase, find the DDthat is the closest to the received K H . In 
the binary case, there are only two values to compute, DDI, K L , LDDDand 

□ □□□ K|_, Loan so that it is easier to set up threshold: DDKl, L) = [□□!, K L , 
LDD+ □□□□ K L , LD such that K H is simply compared to a threshold 

□ □K L , L). The array of the thresholds (typically 256x256) is preferably setup 
beforehand to simplify the computation, i.e., detection can be made with one 
look-up and one comparison. Thus the determination of which pixel is 
associated with a K dot depends on the average lightness of the region. 
Because K L relates to RGB in an average colorimetric way and does not 
comprehend microscopic halftone dot geometry, it can be reduced (using S as 
defined) to: K L = 1 - max(S(R ), S(G), S(B)). The luminance value from the low- 
resolution scans is: L = a S(R ) + b S(G) + c S(B). 

In the alternative, for decoding the GCR-based watermark, the function 

□ □i, R, G ,BDDDwhich relates the expected amount of K to the scanned RGB 
signal, is obtained through a separate calibration process, as follows. A set of 
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CMY values is generated, preferably as a 3-dimensional grid in CMY space. 
These values are processed through each of the A/-GCR functions to obtain N 
sets of CMYK values. If other functions, such as ink-limit, and tone reproduction 
curves (TRCs), are applied during nominal printing of images, these functions 
are to also be applied to the CMYK data. Subsequently, from each CMYK data 
set, a target of patches is generated, printed and scanned. The estimate 0D\0 
R,G,BnDDi=1,...,N, of the amount of black colorant in each patch is obtained for 
each of the N scanned targets corresponding to the A/-GCR functions. One 
exemplary method for doing this has been described previously. The black 
estimates are then used to populate a 3-dimensional lookup table (LUT) whose 
inputs are the scanned RGB values, and whose outputs are the N values of □ 
corresponding to the A/-GCR functions. 

Given a scan of an arbitrary printed image, the black estimate K H is 
derived using the same algorithm employed in the aforementioned calibration 
procedure. The RGB values of the scanned pixels are used to index the 3D 
LUT, and K H is compared with each of the N number of □ values. The closest □ 
value determines which of the /V-symbols was encoded at the given image 
location. 

In the case where N=2, one can alternatively fill the 3-D LUT with the 
threshold: □□ R, G, B) = [□□!, R, G, BDDD+ □□□□ R, G, BDDDnVfenDDDThe 
received black estimate K H is then simply compared with DDR, G, B) to 
determine which of the two symbols was encoded at the given image location. 

It is to be appreciated that methods for filling 3-D LUTs given an arbitrary 
sampling of input and output data is well known in the art. See for example: R. 
Bala, DEVICE CHARACTERIZATION . Digital Color Imaging Handbook, Chapter. 
5, by: G. Sharma, Ed., CRC Press, 2003. It is also noted that pre-conditioning 
transforms may be applied prior to the 3-D LUT that increase the efficiency and 
accuracy of the 3-D LUT. For example, a 3x3 pre-conditioning matrix that 
approximately maps scanned RGB to printer CMY can be followed by a 3-D LUT 
that maps the said printer CMY to the D-values. Since the original calibration 
targets are designed as grids in CMY space, such a preconditioning transform 
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should result in a better utilization of nodes in the 3-D LUT. Also it attempts to 
explicitly recover the original GCR, which is a mapping from CMY to K, rather 
than a mapping from scanner RGB to K. 

In an alternative look-up-table method for decoding, the system is similarly 
trained using a test target, and for every pixel, a quintuple: [R, G, B, K, Q] is 
organized where Q is a discrete number telling which GCR was used to print that 
pixel The RGBK hypercube is mapped into one number Q. The training 
method works by simple majority: The RGBK hypercube is divided into N number 
of cells. Then for every pixel in the printed-and-scanned image, KH and RGB 
are determined. The appropriate Q value is associated with the cell. This is 
repeated for each pixel in every target. After entering the values for every cell, a 
histogram of Q values is determined. The most popular Q value is entered at 
that cell location. Then a LUT is constructed with the RGBK values as input and 
Q as output 

Thus each cell maps the RGBK value to its most likely GCR strategy. To 
ensure that no cells are empty, we start by running the above training algorithm 
with partition of RGBK into 2x2x2x2 cells. If any of the cells is empty, a random 
Q value is assigned to it. The process is repeated for the 4x4x4x4 cell partition. 
At the end of the process, any empty cells inherits the Q value from its 
corresponding position in the 2x2x2x2-partition stage. For non-empty cells, the 
4x4x4x4 value is used. The process is repeated for 8x8x8x8. Any empty cells 
inherit the corresponding Q value from the 4x4x4x4 partition stage. The process 
is repeated again to obtain 16x16x16x16 cells, inheriting Q values from the 
8x8x8x8 partition stage. The 16x16x16x16 partition is probably accurate enough 
to map RGBK values to a small number of GCR functions. The run-time 
detection algorithm works as follows. In the high resolution image, estimate K H . 
Reduce K H image to low resolution. Then, reduce RGB scanner data to low 
resolution. For every pixel in the low resolution image, compute the RGBK H 
quadruple. Feed that RGBK H into the cell LUT, retrieve Q which is the GCR 
estimation. 
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While particular embodiments have been described, alternatives, 
modifications, variations, improvements, and substantial equivalents that are or 
may be presently unforeseen may arise to applicants or others skilled in the art. 
Accordingly, the appended claims as tiled and as they may be amended are 
intended to embrace all such alternatives, modifications variations, 
improvements, and substantial equivalents. 
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